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Abstract. We prove new lower bounds for bounded error quantum communication complexity. 
Our methods are based on the Fourier transform of the considered functions. First we generalize 
a method for proving classical communication complexity lower bounds developed by Raz pS to 
the quantum case. Applying this method we give an exponential separation between bounded error 
quantum communication complexity and nondeterministic quantum communication complexity. We 
develop several other lower bound methods based on the Fourier transform, notably showing that 
\J *(/)/ l°g n i f° r the average sensitivity s(f) of a function /, yields a lower bound on the bounded 
error quantum communication complexity of f(x A y 93 z), where a; is a Boolean word held by Alice 
and y, z are Boolean words held by Bob. We then prove the first large lower bounds on the bounded 
error quantum communication complexity of functions, for which a polynomial quantum speedup 
is possible. For all the functions we investigate, the only previously applied general lower bound 
method based on discrepancy yields bounds that are O(logn). 
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1. Introduction. Quantum mechanical computing and communication has been 
studied extensively during the last decade. Communication has to be a physical pro- 
cess, so an investigation of the properties of physically allowed communication is 
desirable, and the fundamental theory of physics available to us is quantum mechan- 
ics. 

The theory of communication complexity deals with the question how efficient 
communication problems can be solved, and has various applications to lower bound 
proofs for other resources (an introduction to (classical) communication complexity 
can be found in pc|). 

In a quantum protocol (as defined in p9| ) two players Alice and Bob each receive 
an input, and have to compute some function defined on the pair of inputs coopera- 
tively. To this end they exchange messages consisting of qubits, until the result can 
be produced from some measurement done by one of the players (for surveys about 
quantum communication complexity see J3?], |24|). 

It is well known that quantum communication protocols can sometimes be sub- 
stantially more efficient than classical probabilistic protocols: The most prominent 
example of such a function is the disjointness problem DISJ n , in which the players 
receive incidence vectors x,y of subsets of {f, . . . ,n}, and have to decide whether 
the sets are not disjoint: \/(xi A y{). By an application of Grover's search algorithm 
||l8[ to communication complexity given in jl(| an upper bound of 0(y/n log n) holds 
for the bounded error quantum communication complexity of DISJ n . Recently this 
upper bound has been improved to O(y/nc los n ) in The classical bounded error 
communication complexity of DISJ n on the other hand is Q(n) p^ |. The quantum 
protocol for DISJ n yields the largest gap between quantum and classical communica- 
tion complexity known so far for a total function. For partial functions and so-called 
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sampling problems even exponential gaps between quantum and classical communi- 
cation complexity are known, see |3g, |To[ g]. 

Unfortunately so far only few lower bound methods for quantum communication 
complexity are known: the logarithm of the rank of the communication matrix is 
known as a lower bound for exact (i.e., errorless) quantum communication [fl0| , [TT| , the 
(in applications often weak) discrepancy method can be used to give lower bounds for 
protocols with error p7fl . Another method for protocols with bounded error requires 
lower bounds on the minimum rank of matrices approximating the communication 
matrix A very recent result by Razborov (published subsequently to this 

paper) implies such lower bounds for a limited class of functions, previously such 
results were unknown. In this paper we introduce several lower bound methods for 
bounded error quantum communication complexity exploiting algebraic properties of 
the communication matrix. 

Let IP n denote the inner product modulo 2 function, i.e., 

IP n (x,y) = 0(xiAt/i). 

i=l 

Known results about the discrepancy of the inner product function under the 
uniform distribution then imply that quantum protocols for IP n with error 1/2— e have 
complexity Vt(n/2 — log(l/e)), see [|?] (actually only a linear lower bound assuming 
constant error is proved there, but minor modifications give the stated result). The 
inner product function appears to be the only explicit function, for which a large lower 
bound on the bounded error quantum communication complexity has been published 
prior to this paper. 

We prove new lower bounds on the bounded error quantum communication com- 
plexity of several functions. These bounds are exponentially bigger than the bounds 
obtainable by the discrepancy method. Note that we do not consider the model of 
quantum communication with prior entanglement here (which is defined in ) . 

Our results are as follows. First we generalize a lower bound method developed by 
Raz [[35] for classical bounded error protocols to the quantum case. The lower bound 
is given in terms of the sum of absolute values of selected Fourier coefficients of the 
function. To be able to generalize this method we have to decompose the quantum 
protocol into a "small" set of weighted monochromatic rectangles, so that the sum 
of these approximates the communication matrix. Opposed to the classical case the 
weights may be negative, but all weights have absolute value at most 1. 

Applying the method we get a lower bound of f2(n/rogn) for the bounded error 
quantum communication complexity of the Boolean function HAM„^ 2 , where 

HAM t n {x,y) = l dist{x lV )^t <S=^ ^{x, © y t ) ^ t, 

i 

for binary strings x, y of length n and the Hamming distance dist. We then show, 
using methods of de Wolf [Q, that the nondeterministic (i.e., one-sided unbounded 
error) quantum communication complexity of HAM^J" 1 is O(logn). So we get an 
exponential gap between the nondeterministic quantum and bounded error quantum 
complexities. Since it is also known that the equality function EQ n has (classical) 
bounded error protocols with O(logn) communication pa ], while its nondeterministic 
quantum communication complexity is Q(n) p8| , we get the following separation.^] 



1 het BQP denote the bounded error quantum communication complexity, NQP the nondeter- 
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Corollary 1.1. There are total Boolean functions HAM^ 2 , EQ n on 2n inputs 
each, such that 

1. NQC(HAM'n 2 ) = O(logn) and BQC(HAMn /2 ) = n(n/\ogn), 

2. BQC(EQ n ) = O(logn) and NQC(EQ n ) = O(n). 

Furthermore we give quite tight lower and upper bounds for HAM^ for general 
values of t, establishing that bounded error quantum communication does not give 
a significant speedup compared to classical bounded error communication for these 
problems. 

We then turn to several other techniques for proving lower bounds, which are also 
based on the Fourier transform. We concentrate on functions f(x, y) = g(x o y), for 
o € {A,®}, the bitwise conjunction and parity operators. We prove that for o = A, if 
we choose any Fourier coefficient g z of g, then \z\/(l — log|^|) yields a lower bound 
on the bounded error quantum communication complexity of /. Averaging over all 
coefficients leads to a bound given by the average sensitivity of g divided by the 
entropy of the squared Fourier coefficients. We then show another bound for o = © 
in terms of the entropy of the Fourier coefficients and obtain a result solely in terms 
of the average sensitivity by combining both results. 

Corollary 1.2. For all functions f, so that both g(x A y) and g(x © y) with 
g : {0, 1}" -> {0, 1} reduce to f: 

If e.g. f(x, y, z) — g(x A y © z), with x held by Alice and y, z held by Bob, the 
required reductions are trivial. For many functions, e.g. g = MAJ n , it is easy to 
reduce g(x © y) on 2 • n inputs directly to g(x A y) on more inputs using Xi © y% = 
-^Xi A yi + Xi A -it/j (plus the addition of a few dummy variables), and so the lower 
bound of corollary 1.2 can sometimes be used for g(x A y). 

We then modify the lower bound methods, and show how we may replace the 
Fourier coefficients by the singular values of the communication matrix (divided by 
2"). This means that we may replace the Fourier transform by other unitary trans- 
forms and sometimes get much stronger lower bounds. 

Application of the new methods to the Boolean function 

MAJ n (x, y) = l <^=> ^(rr, A Vi ) > n/2 

i 

yields a lower bound of fi(n/logn) for its bounded error quantum communication 
complexity. MAJ n is a function, for which neither bounded error quantum nor non- 
deterministic quantum protocols are efficient, while the discrepancy bound is still only 
O(logn). 

We then apply the same approach to 

COUNT t n (x,y) = l ^2(xiA yi )=t. 

i 

These functions have a classical complexity of 0(n) for all t < n/2, since one can 
easily reduce the disjointness problem to these functions {DISJ n is the complement 
of COUNT^). We show the following: 



ministic quantum communication complexity, QC the weakly unbounded error quantum communi- 
cation complexity (see section 2.2 for definitions). 
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Corollary 1.3. 

Sl(n l ~ c /logn) < BQC(COUNT£~*) < Oin 1 -^ 2 logn). 

BPC(COUNT^) = 6(n) /or a// 1 < n/2. 

These are the first lower bounds for functions which allow a polynomial quantum 
speedup. 

Prior to this paper the only known general method for proving lower bounds 
for the bounded error quantum communication complexity has been the discrepancy 
method. We show that for any application of the discrepancy bound to HAM^, MAJ n , 
and COUNT^, the result is only O(logn). To do so we characterize the discrepancy 
bound within a constant multiplicative factor and an additive log-factor as the classi- 
cal weakly unbounded error communication complexity PC (see sections 2.2/2.4 for 
definitions). 

Corollary 1.4. For all f : {0, 1}" x {0, 1}" -► {0, 1} : 

maxlog(l/ disc J f)) < 0{PC{f)) < 0(maxlog(l/ disc J f)) + logn). 

This explains why the discrepancy bound is usually in applications not a good 
lower bound for bounded error communication complexity, since the weakly un- 
bounded error complexity is always asymptotically at most as large as e.g. the clas- 
sical nondeterministic complexity. For our examples the new lower bound methods 
are exponentially better than the discrepancy bound. In the light of corollary 1.4 it 
becomes clear that actually lower bounds using discrepancy follow the approach of 
simulating quantum bounded error protocols by classical unbounded error protocols 
and subsequent application of a classical lower bound. 

We conclude also that the discrepancy bound subsumes other methods for proving 
lower bounds on the weakly unbounded error communication complexity []l5|| . Fur- 
thermore we investigate quantum protocols with weakly unbounded error and show 
that quantum and classical weakly unbounded error communication complexity are 
asymptotically equivalent. 

The organization of the paper is as follows. In section 2 we describe the necessary 
technical background. Section 3 shows how we can decompose quantum protocols 
into weighted rectangle covers of the communication matrix. Sections 4 and 6 then 
describe our main lower bounds techniques, while sections 5 and 7 show how to apply 
these to specific functions and derive corollaries 1.1 and 1.3. Section 8 is concerned 
with the power of classical and quantum weakly unbounded error protocols. Section 
9 discusses recent developments and open problems. 

2. Preliminaries. Note that we consider functions with range {0, 1} as well as 
with range {—1, 1}. If a result is stated for functions with range {0, 1} then it also 
holds for { — 1,1}. Some results are stated only for functions with range { — 1,1}. The 
communication complexity does not depend on that choice, so this means that certain 
parameters in the lower bounds are dependent on the range. 

2.1. Quantum States and Transformations. Quantum mechanics is usually 
formulated in terms of states and transformations of states. See (3l) for general 
information on this topic with an orientation on quantum computing. 
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In quantum mechanics pure states are unit norm vectors in a Hilbert space, usually 
C fe . We use the Dirac notation for pure states. So a pure state is denoted \4>) or 
Exe{o,...,fe-i} °^\ x ) with Ea:e{o,...,fc-i} Kl 2 = 1 and with {\x)\x e {0,...,k- 1}} 
being an orthonormal basis of C fc . 

Inner products in the Hilbert space are denoted ((f>\ip). 

If k = 2 l then the basis is also denoted { \x) \x £ {0, 1}'}. In this case the space 
(D 2 is the Z-wise tensor product of the space (D 2 . The latter space is called a qubit, 
the former space consists of I qubits. 

As usual measurements of observables and unitary transformations are considered 
as basic operations on states, see |3l| for definitions. 

2.2. The Communication Model. Now we provide definitions of the com- 
putational models considered in the paper. We begin with the model of classical 
communication complexity. 

Definition 2.1. Let f : X xY ^ {0,1} be a function. In a communication pro- 
tocol player Alice and Bob receive x € X and y € Y and compute fix, y). The players 
exchange binary encoded messages. The communication complexity of a protocol is the 
worst case number of bits exchanged for any input. The deterministic communication 
complexity DC(f) of f is the complexity of an optimal protocol for f. 

In a randomized protocol both players have access to private random bits. In the 
bounded error model the output is required to be correct with probability 1 — e for some 
constant 1/2 > e > 0. The bounded error randomized communication complexity of 
a function BPC e (f) is then defined analogously to the deterministic communication 
complexity. We set BPC(f) = BPC 1/3 (f). 

In a weakly unbounded error protocol the output has to be correct with probability 
exceeding 1/2. If the worst case error of the protocol (over all inputs) is 1/2 — S 
and the worst case communication is c, then the cost of the protocol is defined as 
c — [log 8 J . The cost of an optimal weakly unbounded error protocol for a function is 
called PC [f). 

Definition 2.2. Let us note that the communication matrix of a function f : 
X xY — > Z is the matrix with rows labeled by x € X , columns labeled by y £ Y , and the 
entry in row x and column y equal to f(x,y) S Z . A rectangle in the communication 
matrix is a product set of inputs labeled by Ax B with 4CI and B C Y '. Such a 
rectangle is monochromatic, iff all its entries are equal. 

It is easy to see that a deterministic protocol partitions the communication matrix 
into a set of monochromatic rectangles, each corresponding to the set of inputs sharing 
the same communication string produced in the run of the protocol. 

The above notion of weakly unbounded error protocols coincides with another 
type of protocols, namely majority nondctcrministic protocols, which accept an input, 
whenever there are more nondeterministic computations leading to acceptance than 
to rejection. For a proof see theorem 10 in |Ts|| . So weakly unbounded error protocols 
correspond to certain majority covers for the communication matrix as follows: 

Fact 2.3. There is a weakly unbounded error protocol with cost 0(c), iff there is 
a set 2 ( c ) rectangles each labeled either 1 or 0, such that for every input at least one 
half of the adjacent rectangles have the label f(x,y). 

Note that there is another type of protocols, truly unbounded error protocols, 
in which the cost is not dependent on the error, defined by Paturi and Simon |3^| . 
Recently a linear lower bound for the unbounded error communication complexity of 
IP n has been obtained in Jlal. It is not hard to see that the same bound holds for 
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quantum communication as well. An interesting observation is that that the lower 
bound method of |]l6f is actually equivalent to the discrepancy lower bound restricted 
to the uniform distribution. 

Now we turn to quantum communication protocols. For a more formal definition 
of quantum protocols see |39[| . 

Definition 2.4. In a quantum protocol both players have a private set of qubits. 
Some of the qubits are initialized to the input before the start of the protocol, the 
other qubits are in state |0). In a communication round one of the players performs 
some unitary transformation on the qubits in his possession and then sends one of his 
qubits to the other player (the latter step does not change the global state but rather 
the possession of individual qubits). The choices of the unitary operations and of the 
qubit to be sent are fixed in advance by the protocol. 

At the end of the protocol the state of some qubit belonging to one player is mea- 
sured and the result is taken as the output and communicated to the other player. The 
communication complexity of the protocol is the number of qubits exchanged. 

In a (bounded error) quantum protocol the correct answer must be given with 
probability 1 — e for some 1/2 > e > 0. The (bounded error) quantum complexity of a 
function, called BQC e (f), is the complexity of an optimal protocol for f . BQC(f) = 
BQC 1/3 (f). 

In a weakly unbounded error quantum protocol the output has to be correct with 
probability exceeding 1/2. // the worst case error of the protocol (over all inputs) is 
1/2 — S and the worst case communication is c, then the cost of the protocol is defined 
as c — \ \og S\ . The cost of an optimal weakly unbounded error protocol for a function 
is called QC(f). 

In a nondeterministic quantum protocol for a Boolean function f all inputs in 
/ _1 (0) have to be rejected with certainty, while all other inputs have to be accepted 
with positive probability. The corresponding complexity is denoted NQC(f). 

We have to note that in the defined model no intermediate measurements are 
allowed to control the choice of qubits to be sent or the time of the final measure- 
ment. Thus for all inputs the same amount of communication and the same number of 
message exchanges is used. As a generalization one could allow intermediate measure- 
ments, whose results could be used to choose (several) qubits to be sent and possibly 
when to stop the communication protocol. One would have to make sure that the 
receiving player knows when a message ends. While the model in our definition is in 
the spirit of the "interacting quantum circuits" definition given by Yao |3£j , the latter 
definition would more resemble "interacting quantum Turingmachines" . Obviously 
the latter model can be simulated by the former such that in each communication 
round exactly one qubit is communicated. All measurements can then be deferred to 
the end by standard techniques. This increases the overall communication by a factor 
of 2 (and the number of message exchanges by a lot). 

2.3. Fourier Analysis. We consider functions / : {0, 1}™ — - > 1R. Define 

as inner product and use the norm ||/||2 = y (f, /)■ We identify {0, 1}™ with and 
describe the Fourier transform. A basis for the space of functions from — * H is 
given by 



Xz (x) = (-l)^(^) 
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for all z £ Then the Fourier transform of / with respect to that basis is 

^ t fzXzj 

z 

where the f z = (/, Xz) are called the Fourier coefficients of /. If the functions are 
viewed as vectors, this is closely related to the Hadamard transform used in quantum 
computing. 

The following facts are well-known. 

Fact 2.5 (Parseval). For all f: ||/|| = J2 Z fz- 

Fact 2.6 (Cauchy-Schwartz). 

z z \ z 

When we consider (communication) functions / : x Zg — > IR, we use the basis 
functions 

Xz,z'{x,x') = (_i)«'»(*.*)+« , »(*V) 

for all z, z' G Zj x Z2 in Fourier transforms. The Fourier transform of / with respect 
to that basis is 

^ fz,z'Xz,z' j 
z,z' 

where the f ZtZ i — (f,Xz,z') are the Fourier coefficients of /. 

We will decompose communication protocols into sets of weighted rectangles. 
For each rectangle R4 = A l x B l C {0,1}" x {0,1}™ let R l ,A l ,B l also denote the 
characteristic functions associated to the rectangle. Then let on = \Ai\/2 n be the 
uniform probability of x being in the rectangle, and (it — \Bi\/2 n be the uniform 
probability of y being in the rectangle. Let a z .i denote the Fourier coefficients of Ai 
and $ z ,i the Fourier coefficients of It is easy to see that a z ^ -$ z '^ is the z, z'-Fourier 
coefficient of the rectangle function Ri. 

For technical reasons we will sometimes work with functions /, whose range is 
{ — 1,1}. Note that we can set / = 2g — 1 for a function g with range {0, 1}. Since 
the Fourier transform is linear, the effect on the Fourier coefficients is that they get 
multiplied by 2 except for the coefficient of the constant basis function, which is also 
decreased by 1. 

2.4. Discrepancy, Sensitivity, and Entropy. We now define the discrepancy 
bound. 

Definition 2.7. Let fi be any distribution on {0, 1}™ x {0, 1}" and f be any 
function f : {0, 1}" x {0, 1}" -> {0, 1}. Then let 

di S c M (/) = max| M (i?n/- 1 (0))- M (i?nr 1 (i))|, 

IX 

where R runs over all rectangles in the communication matrix of f . 
Then denote disc(f) = min^ disc^(f). 

The application to communication complexity is as follows (see [p7| for a less 
general statement, we also provide a proof for completeness at the end of section 3): 
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Fact 2.8. For all f: 

BQC 1/2 _ e (f) = n(log(e/disc(f))). 

A quantum protocol which computes a function f correctly with probability 1/2 + 
e over a distribution [i on the inputs ( and over its measurements ) needs at least 
ri(log(e/c?isc M (/))) communication. 

We will prove a lower bound on quantum communication complexity in terms of 
average sensitivity. The average sensitivity of a function measures how many of the n 
possible bit flips in a random input change the function value. We define this formally 
for functions with range {—1,1}. 

Definition 2.9. Let f : {0,1}" {-1,1} be a function. For a <E {0,1}™ let 
s a{f) = Y^j—i \\f{ a ) ~ f( a © e i)\ f or the vector containing a one at position i and 
zeroes elsewhere. s a (f) is the sensitivity of f at a. Then the average sensitivity of f 
is defined s(f) = £ ae {o,i}" ^s a {f). 

The connection to Fourier analysis is made by the following fact first observed in 

& 

Fact 2.10. For all f : {0, 1}" -> {-1, 1} : 

*(/)= E i*i 

ze{o,i}" 



So the average sensitivity can be expressed in terms of the expected "height" of 
Fourier coefficients under the distribution induced by the squared coefficients. 
One more notion we will use in lower bounds is entropy. 

Definition 2.11. The entropy of a vector (/i, . . . , f m ) with fi> for all i and 

£/<<i "#(/) = - E£i/< kg 

We follow the convention log = 0. We will consider the entropy of the vector of 
squared Fourier coefficients H(f 2 ) = — £ z fl log(/^). This quantity has the following 
useful property. 

Lemma 2.12. For any f : {0, 1}™ -> TR with ||/|| 2 < 1 : 

H(f 2 ) <21og I 1+ E I A 

\ Z6{0,1}" 

Proof. 

= 2 (E/>g^ + (i-EA 2 )- lo gi) 

< 2 log [ E fz7J7 + (1 - E ' 1 ) ^ Jensen ' s inequality 
<21ogfl + ElAlj ■ 



□ 
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3. Decomposing Quantum Protocols. In this section we show how to decom- 
pose a quantum protocol into a set of weighted rectangles, whose sum approximates 
the communication matrix. 

Lemma 3.1. For all Boolean functions f : {0, 1}" x {0, l} n — > {0, 1}, and for all 
constants 1/2 > e > 0: 

If there is a quantum protocol for f with communication c and error 1/3, 

then there is a real a € [0,1], and a set of 2°^ rectangles Ri with weights 
Wi G {—a, a}, so that 



Y^WiRi[x,y] e | 



[l-e,l] forf(x,y) = l 
for f(x,y) = 0. 



Proof. First we perform the usual success amplification to boost the success 
probability of the quantum protocol to 1 — e/4, increasing the communication to 0(c) 
at most, since e is assumed to be a constant. Using standard techniques || we can 
assume that all amplitudes used in the protocol are real. Now we employ the following 
fact proved in (27| and pi. 

FACT 3.2. The final state of a quantum protocol exchanging c qubits on an input 
(x, y) can be written 

^2 ot m (x)f3 m (y)\A m (x))\m c )\B m (y)), 

me{0,l} c 

where \A m (x)}, \B m (y)) are pure states and a m (x) , m (y) are real numbers from the 
interval [—1,1]. 

Now let the final state of the protocol on (x, y) be 

oi m {x)f3 m (y) | A m (x)\m c ) \ B m (y)) , 

me{0,l} c 

and let 4>{x, y) — 

a. m \ (x)[3 m i (y)\A m i (x)} 1 1) \B ml (y)) 

me{0,l} c_1 

be the part of the state which yields output 1. The acceptance probability of the 
protocol on (x,y) is now the inner product (4>(x, y)\4>(x, y)). Using the convention 

a mp {x) = a ml {x)a p i(x)(A ml (x)\A p i(x)) , 



b mp (y) = (3 m i(y)P P i{y)(B ml (y)\B pl (y)) , 

this can be written as J2 m p a mp(x)b mp (y) . Viewing a mp and b mp as 2 n -dimensional 
vectors, and summing their outer products over all m,p yields a sum of 2°( c ) rank 
1 matrices containing reals between -1 and 1. Rewrite this sum as ^2nOCi0i with 
1 < i < 2°( c ) to save notation. The resulting matrix is an approximation of the 
communication matrix within componentwise error e/4. 

In the next step define for all i a set P a ^ of the indices of positive entries in 
«i, and the set N a ^ of the indices of negative entries of o^. Define Pp^ and Np t i 
analogously. We want to have that all rank 1 matrices either have only positive or 
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only negative entries. For this we split the matrices into 4 matrices each, depending 
on the positivity/negativity of a, and Let 

, J if i£ N a . t 

a iW-\ ai (x) ifxeP^ ' 

and analogously for (3'^ then set the positive entries in and fa to 0. Consider the 
sum X^(a;/3f) + Y^i( a 'iPl ) + X«( a j/^ T ) + Si( a i$ T )- This sum equals the previous 
sum, but here all matrices arc either nonncgative or nonpositive. Again rename the 
indices so that the sum is written J^i a iPl (to save notation). 

At this point we have a set of rank one matrices which are either nonnegative or 
nonpositive with the above properties. We want to round entries and split matrices 
into uniformly weighted matrices. Let C denote the number of matrices used until 
now. 

Consider the intervals [0,e/(16C) ], and [e/(16C) • fc,e/(16C) ■ (fc + 1) ], for all fc 
up to the least fc, so that the last interval includes 1. Obviously there are 0(C) such 
intervals. Round every positive a.i{x) and (3i(x) to the upper bound of the first interval 
it is included in, and change the negative entries analogously by rounding to the upper 
bounds of the corresponding negative intervals. The overall error introduced on an 
input (x, y) in the approximating sum ^\ ai(x)(3i(y) is at most 

i 

+ ^/3 J (y)-e/(16C) + C- e 2 /(16C) 2 

i 

< e/4. 

The sum of the matrices is now between 1 — e/2 and 1 + e/4 for inputs in / _1 (1) and 
between —e/4 and e/2 for inputs in / _1 (0). Add a rectangle with weight e/4 covering 
all inputs. Dividing all weights by 1 + e/2 renormalizes again without increasing the 
error beyond e. 

Now we are left with C rank 1 matrices otiflf containing entries from a 0{C) 
size set only. Splitting the rank 1 matrices into rectangles containing only the en- 
tries with one of the values yields 0(C 2 ) weighted rectangles, whose (weighted) sum 
approximates the communication matrix within error e. 

In a last step we replace any rectangle with an absolute weight value of e/(16C(l + 
e/2)) • k by fc rectangles with weights ±a for a = e/(16C(l + e/2)). The rectangle 
weighted e/4 can be replaced by a set of rectangles with weight a each, introducing 
negligible error. □ 

Now we show the analogous lemma, when we also want to improve the error 
probability beyond a constant. If we would simply decrease the error to l/2 d by 
repeating the protocol before constructing the cover, then we would be forced to work 
with high precision in all steps, increasing the size of the cover to 2°^ c+d "> d \ which 
is undesirable for large d. Instead we first construct a cover with constant error as 
before and then improve the quality of the cover directly. 

Lemma 3.3. For all Boolean functions f : {0, 1}™ x {0, 1}™ — > {0, 1}, and for all 
d>l: 

If there is a quantum protocol for f with communication c and error 1/3, 

then there is a real a G [0,1], and a set of 2°( dc ' rectangles Ri with weights 
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w, 



G {—a, a}, so that 



^2wiRi[x,y] G 



[l-l/2 d ,l] 
[0, l/2 d ] 



for f(x, y) = 1 
for f(x,y) = 0. 



Proof. We start with the result of the previous lemma. The obtained set of 
rectangles approximates the communication matrix within error e for some small 
constant e. Call these rectangles Ri and their weights a, = ±a. 

Doing the same construction for the rejecting part of the final state of the original 
protocol we get a set of 2°(°) weighted rectangles, such that the sum of these is between 
and e on every x,y G and between 1 — e and 1 for every x,y G / _1 (0). Call 

these rectangles R[. Due to the previous construction their weights can be assumed 
to be also a- = ±a. Note that for all x, y : 

J2^Mx,y)) + J2« R ' l (x,y))<i. 

i i 

We construct our new set of rectangles as follows. For every ordered k tuple of 
rectangles containing at least k/2 rectangles Ri and at most k/2 rectangles R[ we 
form a new rectangle by intersecting all of the rectangles in the tuple. The weight of 
the new rectangle is the product of the weights of its constituting rectangles. Now we 
consider the sum of all rectangles obtained this way. 

The number of new rectangles is at most 2°^ ck \ The sum of the weights of 
rectangles adjacent to a zero input x,y of the function is 



< 




2~n(fe) 



for some e > e x>y > (see e.g. lemma 2.3.5 in jl7| for the last inequality). The same 
sum of weights is also clearly at least 0. The sum of the weights of rectangles adjacent 
to a one input x, y of the function is 




> 1 — 2 _n ( fe ) 



for some e > e x<y > 0. The same sum of weights is also clearly at most 1. 

So choosing k = Q(d) large enough yields the desired set of rectangles. □ 
At first glance the covers obtained in this section seem to be very similar to 

majority covers: we have a set of rectangles with either negative or positive weights 
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of absolute value a, and if the weighted sum of rectangles adjacent to some input 
exceeds a threshold, then it is a 1-input. But we have one more property, namely 
that summing the weights of the adjacent rectangles approximates the function value. 
Actually the lower bounds in the next sections and the characterization of majority 
covers (and weakly unbounded error protocols and the discrepancy bound) in section 
8 show that there is an exponential difference between the sizes of the two types of 
covers. 

Now we state another form of the lemma, this time if the error is close to 1/2, 
the proof is essentially the same as for lemma 3.1. 

Lemma 3.4. For all Boolean functions f : {0, 1}" x {0, 1}" — > {0, 1}, and for all 
l/2>e>0: 

If there is a quantum protocol for f with communication c and error 1/2 — e, 
then there is a real a £ [0,1], and a set of 2°^/e rectangles Rj with weights 
Wi G {—a, a}, so that 



Y^WiRi[x,y] G | [of 1/2 



[l/2 + e/2,l] forf(x,y) = l 
e/2] forf(x,y) = 0. 



Note that all results of this section easily generalize to functions with range 
{— 1,+1}. Furthermore all the results generalize to partial functions, i.e., the func- 
tions may be undefined on some inputs. For those inputs the weighted covers produce 
a arbitrary weight between and 1. 

As an application of the decomposition results we now prove fact 2.8. A proof of 
this result seems to be available only in the thesis of Kremer p7| and is stated in less 
generality there, so we include a proof here. 

Proof of fact 2.8. Obviously it suffices to prove the second statement. Let fi be 
any distribution on the inputs. Assume there is a protocol with communication c so 
that the average correctness probability over /j, and the measurements of the protocol 
is at least 1/2 + e. 

Let P(x,y) denote the probability that the protocol accepts x,y and K(x,y) 
denote the probability that the protocol is correct on x,y. W.l.o.g. we assume that 
> m(/ _1 (0))- Then we have 

- fi>( x >y) p ( x >y) 

Y v(x,y) K (x,y) 
+ m(z,i/)a'(s>j/)-M/~ 1 (°)) 

> l/2 + e-l/2 = e. 

Following the construction of lemma 3.4 we get a set of C = 2 0(c ) /e rectangles 
Ri with weights u>; so that the sum of these approximates the acceptance probability 
of the protocol with componentwise additive error e/2. Then 

Y M^jJ/) ^2 w l R t (x,y) 
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53 K X >V) 5Z w i R i (x,y)>e-e/2. 
x-.yef-Ho) i<i<c 



Exchanging sums gives us 




n(x,y)Ri(x,y)- 



J2 li(x,y)Ri(x,y) > e/2 
x-yef-Ho) J 

and 

2 w^irHi) n ft) - m(/ -1 (o) n Jfc)) > e/2. 

l<i<C 

Thus there is a rectangle R, with /i(/ _1 (l) n R t ) - H i2») > (e/2)/C, 

but for all rectangles we have n — ^(/ -1 (0) n J2j) < disc^(f), hence 

disc^f) > (e/2)/C and finally 

9C( C ) / e 
= C > (e/2)/disc p (f) => c > lo : 



disc^f) 



□ 



4. A Fourier Bound. In this section we describe a lower bound method first 
developed by Raz J34j] for classical bounded error communication complexity. We 
prove that the same method is applicable in the quantum case, using the decompo- 
sition results from the previous section. The lower bound method is based on the 
Fourier transform of the function. 

As in section 2.3 we consider the Fourier transform of a communication function. 
The basis functions are labeled by pairs of strings (z, z'). Denote by V the set of all 
pairs (z, z). Let E C V denote some subset of indices of Fourier coefficients. 

The basic idea of the lower bound is that the communication must be large, when 
the sum of the absolute values of a small set of Fourier coefficients is large. 

Theorem 4.1. Let f be a total Boolean function f : {0, 1}™ x {0, 1}™ — > {0, 1}. 

Let E C V . Denote kq = \E\ (the number of coefficients considered) and K\ = 
S(z z)eE (th- e absolute value sum of coefficients considered). Then: 

'ifm > then BQC(f) = n(log(«i)). 

Z/M < 0{^), then BQC(f) = n(tog(/fii)/(log(VK5) - log(«i) + 1)). 

Proof. We are given any quantum protocol for / with error 1/3 and some worst 
case communication c. We have to put the stated lower bound on c. Following 
lemma 3.3 we can find a set of 2°( cd ' ) weighted rectangles, so that the sum of these 
approximates the communication matrix up to error l/2 d for any d > 1, where the 
weights are either a, or —a for some real a between and 1. We will fix d later. Let 
{(Ri,Wi)\l < i < 2°( cd )} denote that set. Furthermore let g{x,y) denote the function 
that maps (x, y) to WiRi(x, y). 

First we give a lower bound on the sum of absolute values of the Fourier coefficients 
in E for g, in terms of the respective sum for /, using the fact that g approximates 
/. Obviously ||/ — g\\% < l/2 d . The identity of Parseval then gives us 



U {f z . z ~g z . z f <\\f~g\\l<^ 

{z,z)£E 
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We make use of the following simple consequence of fact 2.6. 

Fact 4.2. Let \\\v\\\ 2 = VXXi v h and IINK = i M- Then \\\ v ~ w \\h > 
\\\v — w\\\i/y/m > (\\\v\\\i - \ \\w\\\i) / y/m. 
Hence 

E ^ E - ./i^i •£(/*,* -$*,*) 2 

E £ y _B 

Thus the sum of absolute values of the chosen Fourier coefficients of g must be 
large, if there are not too many such coefficients, or if the error is small enough 
to suppress their number in the above expression. Call P — (ki — y^Ko ■ 2~ d ), so 
£ E //-, ' /'. 

Now due to the decomposition of the quantum protocol used to obtain g, the 
function is the weighted sum of C = 2°^ rectangles. Since the Fourier transform is 
a linear transformation, the Fourier coefficients of g are weighted sums of the Fourier 
coefficients of the rectangles. Furthermore the Fourier coefficients of a rectangle are 
the products of the Fourier coefficients of the characteristic functions of the sets 
constituting the rectangle, as argued in section 2.3. So g z ^ z — J2i w i ' &z,i • $z,i and 

(4.i) Ei^.-i <EEi^-^-/M- 

E E i 

For all rectangles Ri we have ^ £ d^ i < ||j4i||l < 1 by the identity of Parseval. 
Using the Cauchy-Schwartz inequality (fact 2.6) we get ^2 E \a z ^fi z ,i\ < 1. But accord- 
ing to (4.1) the weighted sum of these values, with weights between -1 and 1, adds up 
to at least P, and so at least C > P rectangles are there, thus cd = £l(logP). 

If now K\ > f2(y / Ko), then let d — O(l), and we get the lower bound c = 
fi(log(Ki)). Otherwise set d = O(\ogy/Ko — log/ti + 1) to get P — «i/2 as well 
asc = n(log(P)/d)=n(log(/ci)/(log(y^)-log(Ki) + l)). □ 

Let us note one lemma that is implicit in the above proof, and which will be used 
later. 

Lemma 4.3. Let g : {0, 1}" x {0, 1}" — > [—1, 1] be any function such that there is 
a set of Q rectangles Ri with weights Wi G [—1, 1] so that g(x,y) — X)2=i w iRi{ x iU) 
for all x,y. Then 

E \9*,*\<Q- 

££{0,1}™ 

5. Applications. In this section we give applications of the lower bound method. 

5.1. Quantum Nondeterminism versus Bounded Error. We first use the 
lower bound method to prove that nondeterministic quantum protocols may be ex- 
ponentially more efficient than bounded error quantum protocols. Raz has shown the 
following Q: 

Fact 5.1. For the function HAM™^ 2 consider the set of Fourier coefficients with 
labels from a set E containing those strings z,z with z having n/2 ones. Then 

_ ( n \ _ / n\ (n/2\ 1 
K °~ {n/2j' Kl ~ [n/2j In/V^" 
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Thus log(y^Ko) — log(/ci) = O(logn). Also Kj = 6(2™/ 2 /n) and thus log^! = 
9(n). 

Applying the lower bound method we get 
Theorem 5.2. BQC{HAAQ 12 ) = Q(n/\ogn). 

Now we prove that the nondeterministic quantum complexity of HAM„^ 2 is small. 
We use the following technique by de Wolf |}8[ [jlf . 

FACT 5.3. Let the nondeterministic rank of a Boolean function f be the mini- 
mum rank of a matrix that contains at positions corresponding to inputs (x,y) with 
f(x,y) = and nonzero reals elsewhere. Then NQC(f) = \ognrank(f) + 1. 

Theorem 5.4. NQC(HAM^ 2 ) = O(logn). 

Proof. It suffices to prove that the nondeterministic rank is polynomial. Define 
rectangles Mj, which include inputs with Xi = 1 and yi — 0, and iVj, which include 
inputs with X4 = and yi — 1. Let E denote the all one matrix. Then let M = 
+ Ni) — n/2 ■ E. This is a matrix which is exactly at those inputs with 
J2i( x i © Vi) = Furthermore M is composed of 2n + 1 weighted rectangles and 

thus the nondeterministic rank of HAM^ 2 is 0{n). □ 

5.2. The Complexity of the Hamming Distance Problem. Now we deter- 
mine the complexity of HAM„, and show that quantum bounded error communication 
does not allow a significant speedup. 

Theorem 5.5. Let t : IN — ► IN 6e an?/ monotone increasing function with t(n) < 
n/2. Then 

BQC(HAM^) > n f + logn) . 

Proof. We already know that the complexity of HAM^ 2 is fi(n/ logn). Now 
consider functions HAM^ for smaller t. The logarithmic lower bound is obvious from 
the at most exponential speedup obtainable by quantum protocols p7| . 

Fixing n — 2t pairs of inputs variables to the same values leaves us with 2t pairs 
of free variables and the function accepts if HAM\ t accepts on these inputs. Thus 
the lower bound follows. □ 

Theorem 5.6. 

B PC (HAM*) = O(tlogn). 

Proof. The protocol determines (and removes) positions in which x, y are different, 
until no more such positions are present, or until t + 1 such positions are found, in 
both cases the function value can be decided. 

Nisan |}2| has given a protocol in which Alice and Bob, given u-bit strings x,y, 
compute the leftmost bit in which x, y differ. The protocol needs communication 
0(logri — loge) to solve this problem with error e. Hence we can find such a position 
with error l/(3t) and communication O(logn), since t < n. So Alice and Bob can 
determine with error 1/3, whether there are exactly t differences between x and y, 
using communication 0(tlogn) as claimed. □ 

6. More Fourier Bounds. In this section we develop more methods for proving 
lower bounds on quantum communication complexity in terms of properties of their 
Fourier coefficients. Combining them yields a bound in terms of average sensitivity. 
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6.1. A Bound Employing One Fourier Coefficient. Consider functions of 
the type f(x, y) = g{x A y). The Fourier coefficients of g measure how well the parity 
function on a certain set of variables is approximated by g. But if g is correlated with 
a parity (hopefully on a large set of variables), then / should be correlated with an 
inner product function. The hardness result stated in fact 2.8 then gives the intuition 
for the first bound of this section. 

Theorem 6.1. For all total functions f : {0, l}"x{0, 1}™ — > {0, 1} with f(x, y) = 
g(x A y) and all z £ {0, 1}™ : 

BQC(f) = n ( \ zl ) . 
\i - log |a» I / 

Proof. We prove the bound for g with range {—1,1}. Obviously the bound itself 
changes only by a constant factor with this change and the communication complexity 
is unchanged. 

Let z be the index of any Fourier coefficient of g. Let \z\ — m. Basically g z 
measures how well g approximates \z, the parity function on the m variables which 
are 1 in z. Consider the following distribution fi m on {0, l}™ 1 x {0, l} m : Each variable 
is set to one with probability y/l/2 and to zero with probability 1 — y/l/2. Then 
every Xi A yi is one resp. zero with probability 1/2. So under this distribution on the 
inputs (x, y) to / we get the uniform distribution on the inputs z = x A y to g. 

We will get an approximation of IP m under fi m with error 1/2 — |.g z |/4 by taking 
the outputs of a protocol for / under a suitable distribution. We then use a hardness 
result for IP m given by the following lemma. 

Lemma 6.2. Let \x m be the distribution on {0, l} m x {0, 1}"\ that is the 2m-wise 
product of the distribution on {0, 1}, in which 1 is chosen with probability yJX/2. Then 

dis Cllm (IP m ) < 0(2-™/ 4 ). 

Clearly with fact 2.8 we get that computing IP m with error 1/2 — e under the 
distribution fjb m needs quantum communication fi(m/4 + loge). 

Let us prove the lemma. Lindsey's lemma (see e.g. ||) states the following. 

Fact 6.3. Let R be any rectangle with axb entries in the communication matrix 
of IP m ■ Then let 

WRnip-Wl-lRniP^WW^Vriz^. 

The above fact allows to compute the discrepancy of IP n under the uniform 
distribution, and will also be helpful for fi m . 

fi m is uniform on the subset of all inputs x, y containing k ones. Consider any 
rectangle R. There are at most ( 2 ™) inputs with exactly k ones in that rectangle. 
Furthermore if we intersect the rectangle containing all inputs x, y containing i ones 
in x and j ones in y with R we get a rectangle containing at most (™) • ('") < ( 4 2 _[™) 
inputs. In this way R is partitioned into m 2 rectangles, on which /i m is uniform and 
Lindsey's lemma can be applied. Note that we partition the set of inputs with overall 
k ones into up to m rectangles. 
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Let a = y/l/2. The probability of any input with k ones is (1 — a) 2m k ■ a k . We 
get the following upper bound on discrepancy under \i m : 

m 
ij=0 

2m 

ym/2 _ fc _ n _ n \2m-k 



<m2 m / 2 -^a fc -(l- a ) 2 



fe=0 




< m2 m / 2 • V2m + 1 • 



\ 



2m 



, a 2k • (1 - a) im - 2k ■ [ 2m 



< mV2m+ 12 m / 2 (a 2 + (1 - a) 2 )" 1 

< mV2m+ 12 m/2 (2 - V2) m 

< (9(2- m/4 ). 

This concludes the proof of lemma 6.2. 

To describe the way we use this hardness result first assume that the quantum 
protocol for / is errorless. The Fourier coefficient for z measures the correlation 
between g and the parity function Xz on the variables that are ones in z. We first 
show that xi m can be computed with error 1/2 — |<? z |/2 from g (or its complement). 
To see this consider g z = (g,Xz) = J2 a W^^ a ) ' Xz( a )- W.l.o.g. assume that the first 
m variables of z are its ones. So we can rewrite to 



■''--= 12 ^m- 12 ^9(ab)-Xz(ab). 

be{Q,l} n - m a£{0A} m 



Note that Xz depends only on the first m variables. In other words, if we fix a random 
b, the output of g has an expected advantage of \g z \ over a random choice in computing 
parity on the cube spanned by the first m variables. Consequently there must be some 
b realizing that advantage. We fix that b, and use g(ab) (or —g{ab)) to approximate 
Xi m - The error of this approximation is 1/2 — \g z \/2. 

Next we show that IP m resp. Xi m ( x A y) = Xz{{% A y) o b) is correlated with 
g((x Ay) ob) under some distribution. 

Let /i' n be a distribution resulting from //„, if all Xi and j/, for i = m + 1, . . . , n 
are fixed so that Xj A yi = bi- m and all other variables are chosen as for [i n . Then 

I 12 Vn( x ,y)- 9(xAy)-Xz(xAy)\ 

(x,y)e{Q,l} 2 -~ 

= 1 12 9(ab) ■ Xz(ab) ■ ^ V-' n { x ,y)\ 

aG {0,1} m x.y:xAy—ab 

= | £ g(ab)- X z(ab)~\>\9z\. 

oG{0,l} m 

Hence computing / on /j,' n with no error is at least as hard as computing IP m on 
distribution /j, m with error 1/2 — \g z \/2, which needs at least fi(|z|/4 + log \g z \) qubits 
communication due to the discrepancy bound. 

We assumed previously that / is computed without error. Now assume the error 
of a protocol for / is 1/3. Then reduce the error probability to |g z |/4 by repeating the 
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protocol d = 0(1 — log \g z \) times and taking the majority output. Computing / on /j,' n 
with error |<? z |/4 is at least as hard as computing IP m on distribution ji m with error 
1/2 — \g z \/2 + |g z |/4, which needs at least fi(M/4 + log \g z \) qubits communication. 
The error introduced by the protocol is smaller than the advantage of the function / 
in computing IP m . 

So a lower bound of f2(|z|/4 + log \g z \) holds for the task of computing / with 
error |g z |/4. This implies a lower bound of 

»(H/4 + log|g z |) = Q ( \z\ \ 

d \i - lo &\gz\) ' 

for the task of computing / with error 1/3. □ 

Note that the discrepancy of / in the above theorem may be much higher than 
the discrepancy of IP m (leading to weak lower bounds for /) , but that / approximates 
IP m well enough to transfer the lower bound known for IP m (which happens to be 
provable via low discrepancy). 

6.2. A Sensitivity Bound. A weaker, averaged form of the bound in the above 
subsection is the following. 

LEMMA 6.4. For all functions f : {0,1}™ x {0,1}™ — > {-1,1} with f(x,y) = 
g(x A y) : 

Proof. First note that s(g) — J2 Z 9z\ z \ by fact 2.10. So we can read the bound 

The g z define a probability distribution on the z e {0, 1}™. If we choose a z 
randomly then the expected Hamming weight of z is s(g). Also the expectation of 
1 — 2 log \g z \ is 1 + H(g 2 ). We use the following lemma. 

Lemma 6.5. Let ai, . . . , a m be nonnegative and bi, . . . , b. m be positive numbers 
and letpi,..., p m be a probability distribution. Then there is an i with: 

J2jPj a 3 

h - ~ y.,i>A, ' 

To sec the lemma let a — J2jPj a j an( i ^ = ^IjPj^j an( i assume that for all i 
we have aib < bia. Then also for all i with pi > we have Piaib < pibia and hence 
b^2iPia.i < aJ2iPibi, a contradiction. 

So there must be one z, such that \z\/ (1 — log g 2 z ) > s(g)/(l + H{g 2 )). Using that 
z in the bound of theorem 6.1 yields the lower bound. □ 

The above bound decreases with the entropy of the squared Fourier coefficients. 
This seems unnecessary, since the method of theorem 4.1 suggests that functions with 
highly disordered Fourier coefficients should be hard. This leads us to the next bound. 

Lemma 6.6. For all functions f : {0, 1}™ x {0, 1}" -» {-1, 1} : 
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where H D (p) = -j:Jl z \ogfl z . 

Proof. Consider any quantum protocol for / with communication c. As described 
in lemma 3.3, we can find a set of 2°( clog ™) weighted rectangles so that their sum 
yields a function h(x,y) that approximates / entrywise within error 1/n 2 . 

Consequently due to lemma 4.3, the sum of certain Fourier coefficients of h is 
bounded: 

log £ \h z ,z\<0(c\ogn). 
ze{o,i}" 

Also -J2ze{o i}" h l.z^gh 2 z z < 21og(l + ]T^ e{ o 1} „ \h z ,z\) ^ O(clogn) due to lemma 
2.12. 

But on the other hand ||/ — /i|| 2 < l/^ 2 , which we will use to relate i?_o(/ 2 ) to 
Hu{h 2 ). We employ the following lemma. 

Lemma 6.7. Let f, h : {0, 1}™ x {0, 1}™ -» IR with ||/|| 2 , ||ft|| 2 < 1- Then 

E l/^-^Ll<3||/-/ i || 2 . 

zG{0,l}™ 



Let us prove the lemma. Define 

Min z - 

and 

Max, -- 



fz,z if \fz.z\ < \hz.z 

h z , z if \h z , z \ < \fz.z 



fz,z if \fz,z\ > \hz, z \ 

h z ,z if \h z .z\ > \fz,z\- 
Then E ze{ o,i}" \flz - %,z\ = E 2 Ma A - Mini and 

11/ -h\\\> £(/,,, - k ^) 2 = E( Mm - - Max ^ 

Z Z 

Due to the triangle inequality we have 



IJ2Mtnl + \\f-h\\ 2 > J2 Max * 



and 



which implies 



£ Mm 2 z > £ Max 2 z - 2 /£ Mar* ■ 1 1/ - p| | 2) 
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and 



Max 2 z - Mini < 2,£ Max\ • || / - fc|| a 

Z V z 

<2^£/»,+%,.||/-fc|| a 
<2V2\\f~h\\ 2 . 

Lemma 6.7 is proved. 

So the distribution given by the squared z, z-Fourier coefficients of / is close to 
the vector of the squared z, z-Fourier coefficients of h. Then also the entropies are 
quite close, by the following fact (see theorem 16.3.2 in p4[). 

Fact 6.8. Letp,q be distributions on {0, 1}™ with d = \p z — q z \ < 1/2. Then 
\H(p)-H{q)\<d-n-d\ogd. 

Actually the fact also holds if p, q are subdistributions, i.e., if they consist of 
nonnegative numbers summing up to at most 1. 

So we get 

H D (h 2 ) >H D (f 2 )-0(l/n). 
Remembering that Hrj(h) — O(clogn) we get 

H D (f 2 ) < 0(clogn + l/n). 

This concludes the proof. □ 

If f(x,y) = g(x © y), then H D (f 2 ) = H(f 2 ) = H(g 2 ). Now we would like to 
get rid of the entropies in our lower bounds at all, since the entropy of the squared 
Fourier coefficients is in general hard to estimate. Therefore we would like to combine 
the bounds of lemmas 6.4 and 6.6. The first holds for functions g(x A y), the second 
for functions g{x © y). 

Definition 6.9. A communication problem / : {0, 1}™ x {0, 1}" — > { — 1, 1} can 
be reduced to another problem h : {0, l} m x {0, l} m — > {—1, 1}, if there are functions 
a, b so that f(x, y) = h(a(x), b(y)) for all x, y. 

In this case the communication complexity of h is at least as large as the commu- 
nication complexity of /. Note that if m is much larger than n, a lower bound which 
is a function of n translates into a lower bound which is a function of m, and is thus 
"smaller" . For more general types of reductions in communication complexity see || . 

If we can reduce g{x A y) and g{x © y) to some /, then combining the bounds of 
lemmas 6.4 and 6.6 gives a lower bound of Q(s(g)/(1 + H{g 2 )) + H (g 2 )/\ogn), which 
yields corollary 1.2. 

6.3. A Bound Involving Singular Values. We return to the technique of 
lemma 6.6. For many functions, like IP m , the entropy of the squared diagonal Fourier 
coefficients is small, because these coefficients are all very small. We consider the 
entropy of a vector of values that sum to something much smaller than 1 in cases. 
Consequently it may be useful to consider other unitary transformations instead of 
the Fourier transform. 

It is well known that any quadratic matrix M can be brought into diagonal form 
by multiplying with unitary matrices, i.e., there are unitary U, V so that M = UDV* 
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for some positive diagonal D. The entries of D are the singular values of M, they are 
unique and equal to the eigenvalues of VMM* , see (?]] . 

Consider a communication matrix for a function / : {0, 1}" x {0, 1}" — > { — 1, 1}. 
Then let Mf denote the communication matrix divided by 2™. Let oi(/), . . . , 02™ (/) 
denote the singular values of Mf in some decreasing order. In case Mf is symmetric 
these are just the absolute values of its eigenvalues. Let <J 2 (f) denote the vector of 
squared singular values of Mf. Note that the sum of the squared singular values is 1. 
The following theorem is a modification of lemma 6.6 and theorem 4.1. 

Theorem 6.10. Let f : {0, 1}™ x {0, 1}" — > {— 1, 1} be a total Boolean function. 

Then BQC(f) = Q(H(a 2 (f))/ logra). 

Let Kk = o-i(f) H h a k (f). 

Ifn k > Cl(Vk), then BQC(f) = fi(log(/s fc )). 

// K k < O(Vk), then BQC(f) = fi(logM/(log(Vfc) " Iog(«*) + !))• 
Proof. We first consider the entropy bound and proceed similarly as in the proof 
of lemma 6.6. Let / be the considered function and let h be the function computed 
by a protocol decomposition with error 1/n 2 consisting of P rectangles with log P = 
0(c log n) for the communication complexity c of some protocol computing / with 
error 1/3. 

Mf denotes the communication matrix of / divided by 2™, let Mh be the corre- 
sponding matrix for h. Using the Frobenius norm on the matrices we have \\Mf — 
MhWp — \\f — h\\2 < 1/n 2 . Then also the singular values of the matrices are close due 
to the Hoffmann- Wielandt theorem for singular values, see corollary 7.3.8 in p(i| ]. 

Fact 6.11. Let A,B be two square matrices with singular values o~\ > ■ ■ ■ > o~ m 
and fj,x > • • • > fi rn . Then 

Ry^^<\\A-B\\ F . 



As in lemma 6.6 we can use lemma 6.7 to show that the Li-distance between 
the vector of squared singular values of Mf and the corresponding vector for Mh is 
bounded and fact 6.8 to show that the entropies of the squared singular values of Mf 
and Mh are at most o(l) apart. 

It remains to show that H(a 2 (h)) is upper bounded by logP. Due to lemma 2.12 
H(a 2 (h)) < 21og(l + ai(h)). Due to the Cauchy Schwartz inequality we have 

21og(l + ^a i (/i)) 



< 2 log 1^ a 2 (h) y/rank(M h ) + 0(1) 

< log rank(M h ) + O(l) < logP + 0(l). 

The last step holds since Mh is the sum of P rank 1 matrices. We get the desired 
lower bound. 

To prove the remaining part of the theorem we argue as in the proof of theorem 
4.1 that the sum of the selected singular values of Mh is large compared to the sum 
of the selected singular values of Mf , then upper bound the former as above by the 
rank of Mh and thus by P. The remaining argument is as in the proof of theorem 
4.1. □ 
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Note that for IP n all singular values are 1/2™/ 2 , so the entropy of their squares 
is n, while the entropy of the squared diagonal Fourier coefficients is close to 0, since 
these are all (IP n ,Xz.z) 2 = l/2 2n . The log of the sum of all singular values yields a 
linear lower bound. In this case the bounds of lemma 6.6 and theorem 4.1 are very 
small, while theorem 6.10 gives large bounds. 

Ambainis j[) has observed that theorem 6.10 can also be deduced from a lower 
bound on the quantum communication complexity of sampling Q , using success am- 
plification and an argument relating the smallest number of singular values whose 
sum is at least 1 — K 2 /(4fc) to the sum of the first k singluar values in the presence of 
sufficiently small error. 

Note that theorem 6.10 does not necessarily generalize our other bounds in the 
sense that the results obtained by using theorem 6.10 are better for all functions. 

We mention that the quantity <7i + • • • + Cfe is known as the Ky Fan fc-norm of 
a matrix JtJ. Well known examples of such norms are the cases k — 1, which is the 
spectral norm, and the case of maximal k, known as the trace norm. The Ky Fan 
norms are unitarily invariant for all fc, and there is a remarkable fact saying that if 
matrix A has smaller Ky Fan fc-norm than B for all k, then the same holds for any 
unitarily invariant norm. This leads to the interesting statement that the Raz-type 
bound in theorem 6.10 for a function g is smaller than the respective bound for / 
for all k, iff for all unitarily invariant matrix norms |||M g ||| < Under the 

same condition the distribution (cr 2 (/), . . . , cr|„(/)) induced by the singular values of 
Mf majorizes the distribution (a 2 (g), . . . , cr|"(5)) induced by M g . This implies that 
H(o~ 2 (f)) < H(a 2 (g)). Conversely, considering the bounds in theorem 6.10: if the 
entropy bound for g is smaller than the entropy bound for /, then there is a k, so 
that the Raz type bound for k applied to g is bigger than the corresponding bound 
for /. 

6.4. Examples. To conclude this section we give examples of lower bounds prov- 
able using the methods described by theorem 6.1 and corollary 1.2. 
Theorem 6.12. BQC(MAJ n ) = Q(n/logn). 

Proof. We change the range of MAJ n to {— 1,+1}. Now consider the Fourier 
coefficient with index z = l n . MAJ n = g(x A y) for a function g that is 1, if at least 
n/2 of its inputs are one. W.l.o.g. let n/2 be an odd integer. Thus any input to g with 
n/2 ones is accepted by both g and Xz- Call the set of these inputs /. Similarly every 
input to g with an odd number of ones larger than n/2 is accepted by both d and \z 
and every input to g with an even number of ones smaller than n/2 is rejected by both 
d and \ z . On all other inputs g and Xz disagree. Thus there are |7| inputs more being 
classified correctly by Xz than those being classified wrong. The Fourier coefficient 
g z is 2( n ™ 2 j/2™ = n(l/y/n). So the method of theorem 6.1 gives the claimed lower 
bound. □ 

Note also that the average sensitivity of the function g with MAJ n (x 1 y) — g(xAy) 

is e(VH). 

As another example we consider a function g(x A y © z) with a nonsymmetric 
g. Let MED(a) be the middle bit of the median of n/(21ogn) numbers of 21ogn 
bits given in a. Let us compute a lower bound on the average sensitivity of MED. 
For all inputs a there are 0(n/logn) numbers bigger than the median and smaller 
than the median each. For each number p different from the median we can switch 
a single bit to put the changed number below resp. above the median, shifting the 
median in the sorted sequence by one position. For a random a such a bit flip entails 
a change of the middle bit of the median with constant probability. Hence the average 
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sensitivity of MED is at least f2(n/logn). With corollary 1.2 this gives us a lower 
bound of ft(y/ri/ logn) on the bounded error quantum communication complexity of 
MED(xAy(Bz). 

7. Application: Limits of Quantum Speedup. Consider COU NT^{x : y). 
These functions do admit some speedup by quantum protocols, this follows from a 
black box algorithm given in ^ (see also 0), and the results of jit)) connecting the 
black box and the communication model. 

Lemma 7.1. BQC(COUNT^) = 0{^fnl\ogn). 

Note that the classical bounded error communication complexity of all COUNT^ 
is 0(n), by a reduction from DISJ n . 

Theorem 7.2. Let t : IN — > IN be any monotone increasing function with t(n) < 
n/2. Then 

BQC(COUNT^) > n ( + lo S») • 

V log t(n) J 

Proof. First consider COU NT n . This function is equivalent to a function 
g(x A y), in which g is 1 if the number of ones in its input is n/2, and —1 else. 
Consider the Fourier coefficient for z = 1". For simplicity assume that n is even and 
n/2 is odd. Then clearly g z = 2( ?i ™ 2 )/2™ = Q(l/^/n). Thus the method of theorem 
6.1 gives us the lower bound f2(n/logn). Note that finding this lower bound is much 
easier than the computations in section 5 for HAMn^ 2 , since we have to consider only 
one coefficient. 

Now consider functions COUNT^ for smaller t. The logarithmic lower bound is 
obvious from the at most exponential speedup obtainable by quantum protocols [27j] . 

Fixing n/2 — t pairs of inputs variables to ones and n/2 — t pairs of input variables 
to zeroes leaves us with 2t pairs of free variables and the function accepts if COU NT\ t 
accepts on these inputs. Thus the lower bound follows. □ 

Computing the bounds for t = n 1_e yields corollary 1.3. 

8. Discrepancy and Weakly Unbounded Error. The only general method 
for proving lower bounds on the quantum bounded error communication complexity 
has been the discrepancy method prior to this work. We now characterize the param- 
eter disc(f) in terms of the communication complexity of /. Due to fact 2.8 we get 
for all e > 

BQC 1/2 _ e (f) = Q(log(e/ disc(f))) 
=> BQC 1/2 _ e (f) - log(e) = n(log(l/dwc(/))). 
Thus PC(f) > QC(f) = n(log(l/d» aC (/))). 

Theorem 8.1. For all f : {0, 1}" x {0, 1}" -> {0, 1} : 
PC{f) = 0(log(l/ disc(f)) + logn). 

Proof. Let disc(f) = 1/2 C . We first construct a protocol with public random- 
ness, constant communication, and error 1/2 — 1/2 C+1 , using the Yao principle, and 
then switch to a usual weakly unbounded protocol (with private randomness) with 
communication 0(c + logn) and the same error using a result of Newman. 

We know that for all distributions /i there is a rectangle with discrepancy at least 
1/2 C . Then the weight of ones is a + 1/2 C+1 and the weight of zeroes is a — 1/2 C+1 or 
vice versa on that rectangle (for some a£ [0,1/2]). 
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We take that rectangle and partition the rest of the communication matrix into 2 
more rectangles. Assign to each rectangle the label or 1 depending on the majority 
of function values in that rectangle according to /i. The error of the rectangles is at 
most 1/2. If a protocol outputs the label of the adjacent rectangle for every input, 
the error according to // is only 1/2 — 1/2 C+1 . 

This holds for all /i. Furthermore the rectangle partitions lead to deterministic 
protocols with 0(1) communication and error 1/2 — 1/2 C+1 : Alice sends the names 
of the rectangles that are consistent with her input. Bob then picks the label of the 
only rectangle consistent with both inputs. 

We now invoke the following lemma due to Yao (as in p^j). 

FACT 8.2. The following statements are equivalent for all f : 

1. For each distribution fi there is a deterministic protocol for f with error e 
and communication d. 

2. There is a randomized protocol in which both players can access a public 
source of random bits, so that f is computed with error probability e ( over the random 
coins), and the communication is d. 

So we get an O(l) communication randomized protocol with error probability 
1/2 — l/2°( c ) using public randomness. We employ the following result from J|(| to 
get a protocol with private randomness. 

Fact 8.3. Let f be computable by a probabilistic protocol with error e, that 
uses public randomness and d bits of communication. Then BPCn + fi\ e (f) — 0(d + 

l°g(3))- 

We may now choose 8 = l/2°( c ' small enough to get a weakly unbounded error 
protocol for / with cost 0(c + logn). 

Let us also consider the quantum version of weakly unbounded error protocols. 
Theorem 8.4. For all f: PC(f) = 9(QC(/)). 

Proof. The lower bound is trivial, since the quantum protocol can simulate the 
classical protocol. 

For the upper bound we have to construct a classical protocol from a quantum 
protocol. Consider a quantum protocol with error 1/2 — e < 1/2 — 1/2 C and communi- 
cation c. Due to lemma 3.4 this gives us a set of 2°^ weighted rectangles, such that 
the sum of the rectangles approximates the communication matrix entrywise within 
error 1/2 — e/2. The weights are real ±a with absolute value smaller than 1. Label 
the —a weighted rectangles with and the other rectangles with 1, and add (l/2)/a 
rectangles covering all inputs and bearing label 0. This clearly yields a majority cover 
of size 2°( c ), which is equivalent to a classical weakly unbounded error protocol using 
communication 0(c) due to fact 2.3. □ 

It is easy to see that there are weakly unbounded error protocols for HAM^, 
MAJ n , and COUNT^ with cost O(logn). MAJ n is even a complete problem for the 
class of problems computable with polylogarithmic cost by weakly unbounded error 
protocols. 

Lemma 8.5. For f e {MAJ n , HAM^, COUNT*} : max M \og(l /disc^f)) = 
O(logn). 

9. Discussion. In this paper we have investigated the problem of proving lower 
bounds on the bounded error quantum communication complexity. As opposed to 
previous approaches our methods are both general and make use of the quantum 
properties of the protocols (i.e., do not follow the scheme of simulating a bounded 
error quantum protocol by an unbounded error classical protocol and employing a 
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lower bound method for the latter). Our results are strong enough to show separa- 
tions between unbounded error classical and bounded error quantum communication 
resp. between quantum nondeterministic and quantum bounded error communication. 
Our results do not address the more powerful model of quantum communication 



complexity with prior entanglement 112, 13J. It would be interesting to obtain similar 
results for this model. Recently an improved lower bound (compared to |]l3| ) for the 
complexity of IP n in this model has been obtained in [p9[ . These bounds do not show 
hardness under a distribution like in the second statement of fact 2.8, though. So 
constructions similar to that of theorem 6.1 remain unknown for the model with prior 
entanglement. 

More recently Razborov |36| has obtained much stronger lower bounds on the 
quantum communication complexity of g(x A y) for symmetric functions g, almost 
tightly characterizing the quantum bounded error communication complexity of these 
functions, even in the model with prior entanglement. This give a Q(y/n) lower bound 
for DISJ n , previously super logarithmic bounds for this function were known only for 
the cases when strong restrictions on the interaction are imposed ]26[ ] or when the 
error probability is extremely small pl[ | . Razborov's techniques are based on showing 
good lower bounds on the minimal trace norm (sum of singular values) of matrices 
approximating the communication matrix, similar to the approach in theorem 6.10. 
These new results can be used to show that in our corollary 1.3 actually the upper 
bounds for COUNT^ are tight. 

The lower bound methods of this paper can also be applied to other types of 
functions, see sections 5 and 6.4. It would be interesting to find tighter lower bounds 
for these functions and to extend our results to the model with prior entanglement. 

Finally we would like to know if quantum bounded error communication can ever 
be more than quadratically smaller than classical bounded error communication for 
total functions. A first step to resolve this problem would be to show a lower bound 
in terms of (one-sided) block sensitivity on the quantum bounded error complexity of 
all functions g(x A y) (with nonsymmetric g). 

Acknowledgements. The author wishes to thank Ronald de Wolf for bringing 
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pointing out a mistake in an earlier version of the paper. 
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